601 research outputs found
Which gene did you mean?
Computational Biology needs computer-readable information records. Increasingly, meta-analysed and pre-digested information is being used in the follow up of high throughput experiments and other investigations that yield massive data sets. Semantic enrichment of plain text is crucial for computer aided analysis. In general people will think about semantic tagging as just another form of text mining, and that term has quite a negative connotation in the minds of some biologists who have been disappointed by classical approaches of text mining. Efforts so far have tried to develop tools and technologies that retrospectively extract the correct information from text, which is usually full of ambiguities. Although remarkable results have been obtained in experimental circumstances, the wide spread use of information mining tools is lagging behind earlier expectations. This commentary proposes to make semantic tagging an integral process to electronic publishing
The VODAN IN: support of a FAIR-based infrastructure for COVID-19
Molecular Technology and Informatics for Personalised Medicine and Healt
Towards Customizable Chart Visualizations of Tabular Data Using Knowledge Graphs
Scientific articles are typically published as PDF documents, thus rendering the extraction and analysis of results a cumbersome, error-prone, and often manual effort. New initiatives, such as ORKG, focus on transforming the content and results of scientific articles into structured, machine-readable representations using Semantic Web technologies. In this article, we focus on tabular data of scientific articles, which provide an organized and compressed representation of information. However, chart visualizations can additionally facilitate their comprehension. We present an approach that employs a human-in-the-loop paradigm during the data acquisition phase to define additional semantics for tabular data. The additional semantics guide the creation of chart visualizations for meaningful representations of tabular data. Our approach organizes tabular data into different information groups which are analyzed for the selection of suitable visualizations. The set of suitable visualizations serves as a user-driven selection of visual representations. Additionally, customization for visual representations provides the means for facilitating the understanding and sense-making of information
Broadening the Scope of Nanopublications
In this paper, we present an approach for extending the existing concept of
nanopublications --- tiny entities of scientific results in RDF representation
--- to broaden their application range. The proposed extension uses English
sentences to represent informal and underspecified scientific claims. These
sentences follow a syntactic and semantic scheme that we call AIDA (Atomic,
Independent, Declarative, Absolute), which provides a uniform and succinct
representation of scientific assertions. Such AIDA nanopublications are
compatible with the existing nanopublication concept and enjoy most of its
advantages such as information sharing, interlinking of scientific findings,
and detailed attribution, while being more flexible and applicable to a much
wider range of scientific results. We show that users are able to create AIDA
sentences for given scientific results quickly and at high quality, and that it
is feasible to automatically extract and interlink AIDA nanopublications from
existing unstructured data sources. To demonstrate our approach, a web-based
interface is introduced, which also exemplifies the use of nanopublications for
non-scientific content, including meta-nanopublications that describe other
nanopublications.Comment: To appear in the Proceedings of the 10th Extended Semantic Web
Conference (ESWC 2013
Time-resolved photoelectron and photoion fragmentation spectroscopy study of 9-methyladenine and its hydrates: a contribution to the understanding of the ultrafast radiationless decay of excited DNA bases.
The excited state dynamics of the purine base 9-methyladenine (9Me-Ade) has been investigated by time- and energy-resolved photoelectron imaging spectroscopy and mass-selected ion spectroscopy, in both vacuum and water-cluster environments. The specific probe processes used, namely a careful monitoring of time-resolved photoelectron energy distributions and of photoion fragmentation, together with the excellent temporal resolution achieved, enable us to derive additional information on the nature of the excited states (pp*, np*, ps*, triplet) involved in the electronic relaxation of adenine. The two-step pathway we propose to account for the double exponential decay observed agrees well with recent theoretical calculations. The near-UV photophysics of 9Me-Ade is dominated by the direct excitation of the pp* (1Lb) state (lifetime of 100 fs), followed by internal conversion to the np* state (lifetime in the ps range) via conical intersection. No evidence for the involvement of a ps* or a triplet state was found. 9Me- Ade–(H2O)n clusters have been studied, focusing on the fragmentation of these species after the probe process. A careful analysis of the fragments allowed us to provide evidence for a double exponential decay profile for the hydrates. The very weak second component observed, however, led us to conclude that the photophysics were very different compared with the isolated base, assigned to a competition between (i) a direct one-step decay of the initially excited state (pp* La and/or Lb, stabilised by hydration) to the ground state and (ii) a modified two-step decay scheme, qualitatively comparable to that occurring in the isolated molecule
Provenance-Centered Dataset of Drug-Drug Interactions
Over the years several studies have demonstrated the ability to identify
potential drug-drug interactions via data mining from the literature (MEDLINE),
electronic health records, public databases (Drugbank), etc. While each one of
these approaches is properly statistically validated, they do not take into
consideration the overlap between them as one of their decision making
variables. In this paper we present LInked Drug-Drug Interactions (LIDDI), a
public nanopublication-based RDF dataset with trusty URIs that encompasses some
of the most cited prediction methods and sources to provide researchers a
resource for leveraging the work of others into their prediction methods. As
one of the main issues to overcome the usage of external resources is their
mappings between drug names and identifiers used, we also provide the set of
mappings we curated to be able to compare the multiple sources we aggregate in
our dataset.Comment: In Proceedings of the 14th International Semantic Web Conference
(ISWC) 201
Cloudy, increasingly FAIR; Revisiting the FAIR Data guiding principles for the European Open Science Cloud
The FAIR Data Principles propose that all scholarly output should be Findable, Accessible, Interoperable, and Reusable. As a set of guiding principles, expressing only the kinds of behaviours that researchers should expect from contemporary data resources, how the FAIR principles should manifest in reality was largely open to interpretation. As support for the Principles has spread, so has the breadth of these interpretations. In observing this creeping spread of interpretation, several of the original authors felt it was now appropriate to revisit the Principles, to clarify both what FAIRness is, and is not
Fluoxetine effects assessment on the life cycle of aquatic invertebrates
International audienceFluoxetine is a serotonin re-uptake inhibitor, generally used as an antidepressant. It is suspected to provoke substantial effects in the aquatic environment. This study reports the effects of fluoxetine on the life cycle of four invertebrate species, Daphnia magna, Hyalella azteca and the snail Potamopyrgus antipodarum exposed to fluoxetine spiked-water and the midge Chironomus riparius exposed to fluoxetine-spiked sediments. For D. magna, a multi-generational study was performed with exposition of newborns from exposed organisms. Effects of fluoxetine could be found at low measured concentrations (around 10 micro g l(-1)), especially for parthenogenetic reproduction of D. magna and P. antipodarum. For daphnids, newborns length was impacted by fluoxetine and the second generation of exposed individuals showed much more pronounced effects than the first one, with a NOEC of 8.9 micro g l(-1). For P. antipodarum, significant decrease of reproduction was found for concentrations around 10 micro g l(-1). In contrast, we found no effect on the reproduction of H. azteca but a significant effect on growth, which resulted in a NOEC of 33 micro g l(-1), expressed in nominal concentration. No effect on C. riparius could be found for measured concentrations up to 59.5 mg kg(-1). General mechanistic energy-based models showed poor relevance for data analysis, which suggests that fluoxetine targets specific mechanisms of reproduction
Mining microarray datasets aided by knowledge stored in literature
DNA microarray technology produces large amounts of data. For data mining
of these datasets, background information on genes can be helpful.
Unfortunately most information is stored in free text. Here, we present an
approach to use this information for DNA microarray data mining
- …